Skip to content

Conversation

@jp-agenta
Copy link
Member

release/v0.61.1

Copilot AI review requested due to automatic review settings November 10, 2025 14:23
@dosubot dosubot bot added the size:XXL This PR changes 1000+ lines, ignoring generated files. label Nov 10, 2025
@CLAassistant
Copy link

CLAassistant commented Nov 10, 2025

CLA assistant check
Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
1 out of 2 committers have signed the CLA.

✅ jp-agenta
❌ GitHub CI


GitHub CI seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
You have signed the CLA already but the status is still pending? Let us recheck it.

Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR bumps the version from 0.60.2 to 0.61.1 and introduces significant enhancements to the evaluation system, particularly around custom evaluations, evaluator revisions, and metric handling. The changes span across the web frontend (both OSS and EE editions) and the Python SDK.

Key changes include:

  • Introduction of "custom" evaluation type throughout the codebase alongside existing "auto", "human", and "online" types
  • Implementation of evaluator revision fetching and merging logic to support version-based evaluator definitions
  • Enhanced metric column factory with improved slug resolution, nested metric support, and better type inference from statistics
  • SDK workflow improvements making slug parameters optional for built-in evaluators
  • New SDK model structures for evaluations, testsets, and git-based artifact management
  • Improved metric key normalization and fallback resolution in the focus drawer
  • CSV export functionality for custom evaluations

Reviewed Changes

Copilot reviewed 113 out of 118 changed files in this pull request and generated 55 comments.

Show a summary per file
File Description
web/package.json Version bump to 0.61.1
web/oss/package.json Version bump to 0.61.1
web/ee/package.json Version bump to 0.61.1
web/oss/src/state/evaluators/atoms.ts Added evaluator revision fetching and merging logic with new utility functions
web/oss/src/state/app/hooks.ts Enhanced app filtering logic to exclude SDK evaluation apps
web/oss/src/state/app/atoms/fetcher.ts Added filtering and new app detail query atom
web/oss/src/lib/hooks/useEvaluators/types.ts Added EvaluatorRevisionDto types
web/oss/src/lib/hooks/useEvaluators/index.ts Removed unused rest parameter
web/oss/src/lib/Types.ts Fixed semicolon formatting
web/ee/src/lib/metricColumnFactory.tsx Major refactor with nested metric support, improved slug resolution, and type inference
web/ee/src/components/pages/evaluations/* Added "custom" evaluation type support throughout
web/ee/src/components/HumanEvaluations/assets/utils.tsx Enhanced metric collection and evaluator slug resolution
web/ee/src/components/EvalRunDetails/* Updated to support custom evaluation type
sdk/pyproject.toml Version bump and dependency updates
sdk/agenta/sdk/workflows/utils.py Renamed PARAMETERS_REGISTRY to CONFIGURATION_REGISTRY
sdk/agenta/sdk/workflows/builtin.py Made slug parameter optional for all built-in workflows
sdk/agenta/sdk/utils/references.py New utility for slug generation
sdk/agenta/sdk/utils/client.py New authenticated API client utility
sdk/agenta/sdk/models/* New model files for workflows, evaluations, testsets, git, and blobs

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@dosubot dosubot bot added size:S This PR changes 10-29 lines, ignoring generated files. and removed size:XXL This PR changes 1000+ lines, ignoring generated files. labels Nov 10, 2025
@jp-agenta jp-agenta enabled auto-merge November 10, 2025 14:40
@jp-agenta jp-agenta merged commit d5a9463 into main Nov 10, 2025
5 of 6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Backend SDK size:S This PR changes 10-29 lines, ignoring generated files. tests

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants